Cross Lingual Query Dependent Snippet Generation
نویسندگان
چکیده
The present paper describes the development of a cross lingual query dependent snippet generation module. It is a language independent module, so it also performs as a multilingual snippet generation module. It is a module of the Cross Lingual Information Access (CLIA) system. This module takes the query and content of each retrieved document and generates a query dependent snippet for each retrieved document. It highlights all the query words, which appear in the generated snippet. The algorithm of this module based on the sentence extraction, sentence scoring and sentence ranking. Subjective evaluation has been done to evaluate the output of this module. English snippet got the best evaluation score, i.e. 1 and overall average evaluation score of 0.83 has been achieved in the scale of 0 to 1. Keywords— Snippet Generation, Summarization, Information Extraction, Information Retrieval, Cross Lingual, Multilingual.
منابع مشابه
Bridging the Gap between Intrinsic and Perceived Relevance in Snippet Generation
Snippet generation plays an important role in a search engine. Good snippets provide users a good indication on the main content of a search result related to the query and on whether one can find relevant information in it. Previous studies on snippet generation focused on selecting sentences that are related to the query and to the document. However, resulting snippet may look highly relevant...
متن کاملRanking of Resulting Objects and Snippet Generation
Semantic web search engine Falcons support keyword based search for linked objects by using comprehensive virtual document which it creates for each object. In our work we are suggesting idea of using Selectivity Estimation of triple patterns for ranking of resulting objects and generating snippet for the keyword query for Falcons Semantic web search engine. Selectivity of a triple pattern is t...
متن کاملPseudo-relevance feedback and statistical query expansion for web snippet generation
a r t i c l e i n f o a b s t r a c t A (page or web) snippet is a document excerpt allowing a user to understand if a document is indeed relevant without accessing it. This paper proposes an effective snippet generation method. A statistical query expansion approach with pseudo-relevance feedback and text summarization techniques are applied to salient sentence extraction for good quality snip...
متن کاملxLiD-Lexica: Cross-lingual Linked Data Lexica
In this paper, we introduce our cross-lingual linked data lexica, called xLiD-Lexica, which are constructed by exploiting the multilingual Wikipedia and linked data resources from Linked Open Data (LOD). We provide the cross-lingual groundings of linked data resources from LOD as RDF data, which can be easily integrated into the LOD data sources. In addition, we build a SPARQL endpoint over our...
متن کاملParsing the Wiki Collection and Snippet Generation A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Sai Subramanyam Chittilla IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE
Information Retrieval (IR) is a field which deals with retrieving useful information from large sets of data in response to a query. Much information in this digital age is stored in XML format, which associates a structure with a document. Though IR systems have been used for years to access documents, the field has greatly expanded with the emergence of the world wide web, which emphasizes th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012